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Background and objective: The advent of next-generation sequencing has significantly facilitated character- 
ization of the oral microbiome. Despite great efforts in streamlining the processes of sequencing and data 
curation, upstream steps required for amplicon library generation could still influence 16S rRNA gene-based 
microbial profiles. Among upstream processes, DNA extraction is a critical step that could represent a great 
source of bias. Accounting for bias introduced by extraction procedures is important when comparing 
studies that use different methods. Identifying the method that best portrays communities is also desirable. 
Accordingly, the aim of this study was to evaluate bias introduced by different DNA extraction procedures on 
oral microbiome profiles. 

Design: Four DNA extraction methods were tested on mock communities consisting of seven representative 
oral bacteria. Additionally, supragingival plaque samples were collected from seven individuals and divided 
equally to test two commonly used DNA extraction procedures. Amplicon libraries of the 16S rRNA gene 
were generated and sequenced via 4 54-pyro sequencing. 

Results: Evaluation of mock communities revealed that DNA yield and bacterial species representation varied 
with DNA extraction methods. Despite producing the lowest yield of DNA, a method that included bead 
beating was the only protocol capable of detecting all seven species in the mock community. Comparison of 
the performance of two commonly used methods (crude lysis and a chemical/enzymatic lysis + column-based 
DNA isolation) on plaque samples showed no effect of extraction protocols on taxa prevalence but global 
community structure and relative abundance of individual taxa were affected. At the phylum level, the latter 
method improved the recovery of Actinobacteria, Bacteroidetes, and Spirochaetes over crude lysis. 
Conclusion: DNA extraction distorts microbial profiles in simulated and clinical oral samples, reinforcing the 
importance of careful selection of a DNA extraction protocol to improve species recovery and facilitate data 
comparison across oral microbiology studies. 
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The oral cavity harbors complex microbial com- 
munities. Studies using various molecular techni- 
ques have helped unravel the diversity of the oral 
microbiome at different sites, providing new insights on 
how microbial communities change in relation to oral 
diseases (1-5). Nevertheless, the use of these culture- 
independent approaches has limitations related to biases 



introduced by DNA extraction, polymerase chain reac- 
tion (PCR), and sequencing (6-9). The influence of DNA 
extraction protocols on the characterization of microbial 
communities has been described as a key issue in studies 
of the human microbiome of the respiratory tract (10) 
and colon (11, 12) and in environmental samples (13). 
Regarding the oral microbiota, the importance of DNA 
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extraction has also been discussed (6, 14); however, to 
our knowledge, there are no reports that systematically 
evaluate the effect of DNA extraction on the character- 
ization of oral microbial communities. Thus, the aim 
of this study was to evaluate the performance of four 
different DNA extraction protocols on a mock microbial 
community and to compare the microbial content of 
supragingival plaque samples after applying two DNA 
extraction methods commonly used in oral microbiome 
studies (1, 5, 6, 15, 16). 

Methods 

Mock community assembly 

Mock communities were prepared as previously described 
(6) by mixing seven representative oral bacteria (Strep- 
tococcus oralis 34, Streptococcus mutans NCTC 10449, 
Lactobacillus casei LR1, Actinomyces oris T14v, Fuso- 
bacterium ATCC 10953, Porphyromonas gingivalis ATCC 
33277, and Veillonella parvula PK 1910). Bacteria grown in 
liquid cultures were quantified using a Petroff-Hausser 
counting chamber, diluted in PBS and mixed in equal 
proportions to a final concentration for each species of 10 7 
cells uL _1 . Cell mixtures (1 uL) were then centrifuged 
(3,300 xg), pellets resuspended in 100 uL of TE buffer 
(10 mM Tris-HCl pH 7.4, 1 mM EDTA) and stored at 
-80°C. 

Human sampling procedures 

Seven subjects participated in this study. Subjects were 
enrolled at the Dental Clinic of the Faculty of Dentistry 
at the University of Chile, under an approved protocol. 
The individuals were systemically healthy, with no anti- 
biotic use in the previous 3 months and were period- 
ontally healthy as determined by no site with probing 
depths >3 mm and < 10% of sites presenting bleeding 
on probing. A supragingival plaque sample was collected 
using a sterile curette from the lingual aspect of a first 
mandibular molar, placed in a vial with 100 uL of TE 
buffer and stored at — 80°C until processing. 

DNA extraction 

Four DNA extraction methods, described below, were 
used in this study. 

Crude chemical/enzymatic lysis method (C) 

Tween 20 (Sigma-Aldrich, St. Louis, MO) at a final 
concentration of 0.5% v/v and Proteinase K (Promega, 
Madison, WI) at a final concentration of 0.2 mg mL" 1 
were added to samples, which were then incubated at 55°C 
for 2 h and then at 95°C for 5 min. One microliter of this 
lysate was directly used in PCR reactions. This protocol 
has been widely used for the characterization of oral 
microbial communities (1, 15, 16). 



Chemical/enzymatic lysis + DNeasy Blood 
and Tissue kit (Q) 

Samples were mixed with lysozyme (final concentration 20 
mg mL -1 ) and incubated at 37°C for 30 min. Buffer AL 
(from the DNeasy Blood and Tissue kit, Qiagen Valencia, 
CA) and Proteinase K (to a final concentration of 1 .23 mg 
mL ~~ l ) were then added and samples were incubated over- 
night at 56°C. The following day, samples were incubated 
for 5 min at 95°C. DNA isolation was then performed 
using the DNeasy Blood and Tissue kit (Qiagen) in 
accordance with the manufacturer's instructions and 
DNA was eluted in MD5 solution (MoBio laboratories, 
Carlsbad, CA). This protocol has been used by our group 
for the characterization of the subgingival, mucosal, and 
salivary microbiomes (5, 6). 

Chemical/enzymatic lysis + boiling + DNeasy 
Blood and Tissue kit (QB) 

This method consisted of a modification of the Q 
protocol, in which an additional step of heating at 100°C 
for 10 min was introduced immediately after Proteinase 
K inactivation. Boiling of samples has been previously 
shown to increase the representation of species that are 
difficult to lyse, such as Actinomyces (7). 

Bead beating + Fast DNA Spin Kit (BB) 
Samples were added to a bead beating matrix containing 
0.4 g Lysing Matrix B (MP Biomedicals, Santa Ana, CA) 
supplemented with 1 g of very high density 0.5 mm yttria- 
stabilized zirconia (95% Zr02 + 5% Y2O3) grinding media 
(Glen Mills Inc., Clifton, NJ). The FastDNA Spin 
Kit (MP Biomedicals) was then used according to the 
manufacturer's protocol (for yeast, algae or fungi) with 
three modifications: (1) decreased Cell Lysis Solution for 
Yeast (CLS-Y) to 800 uL to allow for sufficient air in the 
tube for thorough homogenization; (2) three homoge- 
nizations at speed 5.0 for 30 sec in the FastPrep-24 
Instrument (MP Biomedicals) separated by 5 min on ice 
to keep samples cool (17), instead of a single homogeniza- 
tion step; (3) the addition of a second wash with Salt 
Ethanol Wash Solution (SEWS-M) to facilitate removal 
of lysing solutions, which we found to persist and form 
precipitates in eluted gDNA if only one wash was per- 
formed as in the protocol. We also used a pre-heated 
incubator (instead of a water bath or heat block) during 
DNA elution. The lysis solution used with this kit con- 
tains detergents and salts only. This protocol was inves- 
tigated as a possible common one for both bacterial and 
fungal DNA extraction, since research from our labora- 
tories (unpublished) has demonstrated its effectiveness 
in releasing DNA from fungi in oral samples. 

DNA was isolated from replicate mock communities 
(independently prepared) using the four methods de- 
scribed above and from equally divided plaque sam- 
ples using the C and Q methods. All DNA extraction 
procedures were performed in parallel with a negative 
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control (TE buffer). DNA concentrations were mea- 
sured using a NanoDrop Instrument (Thermo Scientific, 
Willmington, DE). 

Library construction, sequencing, and data 
processing 

Amplicon libraries of 16S rRNA gene VI -V2 hyper- 
variable regions were prepared in triplicate from each 
extracted sample, combined and sequenced using 454 
Titanium chemistry (Roche, 454 Life Sciences, Branford, 
CT) according to protocols previously described (6). 
Mothur was used for processing of sequences (18), 
following the pipeline developed by Schloss et al. (19), 
as modified by our group (6). Individual sequences 
were assigned a taxonomy using mothur's version of 
the Ribosomal Database Project (RDP) classifier (20) 
and the Human Oral Microbiome Database (HOMD) 
(21, 22) and RDP database (version 9) as templates, with 
an 80% bootstrapping cutoff. For mock community 
analysis, sequences were binned according to their tax- 
onomical assignment. For plaque samples, sequences 
were clustered into operational taxonomic units (OTUs) 
(at 97% similarity) and each OTU was assigned a taxo- 
nomy, which corresponded to the taxonomic assignment 
for the majority of sequences within the OTU. When 
a consensus assignment at the species level was not 
possible, the highest taxonomical order in which a con- 
sensus was reached is reported. In this case, the repre- 
sentative sequence for each OTU (middle sequence) was 
also compared to the HOMD and its taxonomy reported 
in parentheses if this comparison resulted in >97% 
homology to a HOMD Oral Taxon (OT). Sequences 
were submitted to the Short Reads Archive (SRP038999 
for supragingival plaque samples and SRP039007 for 
mock communities). 

Statistical analyses 

Differences between the Q and C protocols when applied 
to supragingival plaque samples were evaluated using 
paired Mests for a-diversity metrics and with Analysis of 
Molecular Variance (AMOVA) (23) for P-diversity com- 
parisons. Differences in the relative abundance of OTUs, 
genera, and phyla were assessed with Wilcoxon Signed 
Rank tests with significance thresholds adjusted using 
the Benjamini-Hochberg false discovery rate method. 

Results and discussion 

Evaluation of four DNA extraction protocols 
using a mock community 

DNA yield from mock communities varied according 
to the method used. DNA yield of samples processed with 
the C protocol could not be determined due to the nature 
of the product (crude lysate with no DNA purification 
step). The yield of DNA was 215-295 ng for the 



Q method, 210-330 ng for the QB method, and 75-95 
ng for the BB method. These results are in agreement 
with those of Yuan et al. (24), who generally observed 
higher DNA yields from defined species using chemical 
and enzymatic lysis prior to DNA isolation than when 
these methods were supplemented with bead beating. 

Samples extracted with all methods yielded PCR 
amplicons, which were sequenced using 454 technology 
with an average of 5,657 + 930 sequences per library. As 
expected, bacterial species representation varied accord- 
ing to DNA extraction procedure. As Fig. 1 shows, an 
over-representation of F. nucleatum was observed with 
the C protocol, while the rest of the species, with the 
exception of V. parvula, were under-represented with this 
method. The Q method resulted in over-representation 
of S. oralis, while other Firmicutes such as S. mutans, 
V parvula, and L. casei were under-represented, as was 
P. gingivalis. The addition of a prolonged boiling step to 
the Q method (Q + B) had the most noticeable effect on 
V. parvula, which increased after boiling, and S. oralis, 
which decreased in comparison to Q alone. Importantly, 
the method that included bead beating (BB) was the only 
one able to detect all species in the mock community 
(L. casei only appeared with BB). 

These results are in agreement with other studies, which 
have observed that species proportions vary according 
to DNA extraction procedures (10, 12, 24). Use of 
lysozyme is thought to improve accuracy of community 
profiles, due to the ability of the enzyme to disrupt 
peptidoglycans in cell walls, thereby enhancing nucleic 
acid recovery from Gram-positives. Indeed, inclusion of 
lysozyme in our study (Q and Q + B methods) generally 
improved species representation compared to chemical 
lysis and Proteinase K treatment alone (C), although some 
Gram-positive species such as S. mutans and L. casei, and 
also the Gram-negative P. gingivalis, remained greatly 
under-represented. Use of additional enzymes may there- 
fore be necessary to improve accuracy, as suggested by 
Yuan et al. (24), who found that a cocktail containing 
lysozyme, mutanolysin, and staphilysin performed better 
than lysozyme alone. 

Despite low DNA yields, the only method that recov- 
ered all species was the BB protocol, which was designed 
by our group for efficient isolation of fungal DNA from 
oral samples (25). This method, however, also yielded a 
high proportion of contaminant sequences, all of them 
classified as Staphylococcus aureus (see 'other' column 
in Fig. 1). Contamination issues have been previously 
reported in sequenced mock communities and are attrib- 
uted to reagent impurity (10). This result highlights the 
importance of including reagent controls in every 
run. Nevertheless, the inclusion of a bead beating step 
seems to be beneficial in improving accuracy in species 
representation. Our data are again in agreement with 
Yuan et al. (24), who reported that although the DNA 
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Fig 1. Species representation in a mock community treated with four different DNA extraction methods prior to amplification 
and sequencing. Graph depicts the expected and obtained relative abundances for seven bacterial species after DNA extraction 
with a crude chemical/enzymatic lysis protocol (C), chemical/enzymatic lysis + DNeasy Blood and Tissue kit (Q), chemical/ 
enzymatic lysis + boiling + DNeasy Blood and Tissue kit (QB), and bead beating + Fast DNA Spin Kit (BB). The expected 
relative abundance was calculated taking into account cell input, copy number of 16S RNA molecules for each species and 
known PCR bias as previously determined by our group (6). 



yield of methods that included bead beating tended to 
be low, this procedure improved accuracy of community 
portrayals. 

Evaluation of the supragingival plaque microbiome 
using methods Q and C 

Seven supragingival plaque samples were split and 
processed with two DNA extraction methods (C and Q 
protocols), commonly used in the oral microbiology 
literature (1, 5, 6, 15, 16). This dataset was comprised 
of 99,883 sequences (after processing), with an average 
of 7,135 + 2,710 sequences per sample. A total of 436 
OTUs (defined at 97% homology level) were found across 
all samples, with a range of 96-193 OTUs per sample. 
Richness coverage according to CatchAll (26), ranged 
from 54.8 to 84.4%. 

Pairwise comparisons of a-diversity metrics between 
samples processed with the C and Q methods showed 
no significant differences (data not shown). When samples 
were compared based on a P-diversity metric that evalu- 
ates distances among communities based on the presence/ 
absence of OTUs (Jaccard metric), DNA extraction was 
shown not to be an influencing factor on community 
composition (AMOVA, P = 0.968), with samples cluster- 
ing by subject rather than by method (Fig. 2A). This result 
agrees with previous findings from studies evaluating 



the effect of DNA extraction in stool and human colon 
specimens demonstrating that inter-subject differences are 
greater than differences introduced by DNA extraction 
protocols (11, 12). On the contrary, comparison of the 
distance among samples based on the 9 Y c metric (Fig. 2B), 
which takes into account species proportions, showed 
a tendency for clustering according to DNA extraction 
method, although AMOVA was still not significant 
(P = 0.19). It is evident, however, that a trend existed for 
sample separation along axis 2, with C samples located 
towards the positive end. We then determined which taxa 
showed a significant correlation (Spearman) with axis 2, 
therefore driving separation of samples along this axis. 
The most significantly correlated OTUs are depicted in 
Fig. 2C. Surprisingly, OTUs that drove samples towards 
the positive end of axis 2 belonged to the Firmicutes and 
were Gram-positive species. Additionally, OTUs that 
drove samples towards the negative end of axis 2 included 
Gram-negative species from the Bacteroidetes and Pro- 
teobacteria and expectedly a Gram-positive OTU from 
the genus Actinomyces. 

To further evaluate bias introduced by employing the 
C and Q methods on supragingival plaque microbial 
profiles, differences in the relative abundance of individual 
taxa were determined. First, however, we acknowledge 
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Fig 2. Effect of two DNA extraction protocols on global profiles of supragingival plaque communities. Samples from seven 
subjects were split and processed with the crude chemical/enzymatic lysis (C) protocol (gray circles) or with the chemical/ 
enzymatic lysis + DNeasy Blood and Tissue kit (Q) method (black circles). Principal coordinate analysis plots show distance 
among samples based on the Jaccard (A) and 0 Y c (B) metrics, which reflect community membership and community structure, 
respectively. Samples belonging to the same subject were encircled. Panel C depicts a biplot also based on 0 Y c distances with 
vectors indicating taxa responsible for separation of data points along axis 2. Shown OTUs had a P<0.01 before multiple 
comparison adjustment. An asterisk indicates OTUs that remained significant after multiple test correction. 



that this study has a low sample size (seven subjects), 
which, in combination with the great inter-subject varia- 
bility of the human microbiome, could understate differ- 
ences. With awareness of this limitation, we present data 
on 33 OTUs that showed a P value of less than 0.05 when 
their relative abundances in both groups were compared 
(Fig. 3 A). No OTU, however, remained significant after 
multiple testing adjustment, and therefore these data 
only indicate trends. In agreement with mock community 
results, Fig. 3A shows that the Q method improved the 
recovery of Actinomyces spp. Interestingly, Gram- negative 
species from the genera Prevotella and Porphyromonas 
were also better recovered after DNA extraction using 
the Q protocol. Moreover, in contrast with the over- 
representation of F nucleatum in mock communities by 
the C method, F nucleatum was seen at greater abundance 
after Q treatment in the more diverse supragingival 
samples. 

Figure 3B shows the mean averages for the phyla 
detected, confirming the results from OTU data in that 
the Firmicutes were greatly over-represented by the crude 
lysis method (C), while the Bacteroidetes, Spirochaetes, 
and Actinobacteria were more efficiently disrupted by the 
method that included lysozyme treatment and column- 
based DNA isolation (Q). The cumulative proportions 
of TM7 also appeared to be higher in the C group. Our 
results are in agreement with other studies of the gut 
microbiome reporting Firmicutes as the phylum with the 



greatest variability according to DNA extraction proce- 
dures (11, 12). In the studies cited, however, Firmicutes 
recovery was improved by utilizing harsher extraction 
methods, while in our study their relative abundance was 
decreased with the harsher protocol, which also included 
lysozyme (Q). This result may be due to increased 
representation of more difficult to lyse species after using 
the Q method (e.g. Actinobacteria), rather than an actual 
change in lysis of Firmicutes, a suggestion that dominant 
oral Firmicutes (e.g. Mitis-group Streptococcus spp. 
and Veillonella spp.) may not be as difficult to lyse as 
Firmicutes of the lower intestinal tract (11, 12). It is also 
worth noting that a great number of oral Firmicutes 
species, such as Veillonella, Dialister; and Selenomonas, 
are Gram-negative and in theory better lysed by Q. These 
results highlight the importance of understanding bias 
introduced by DNA extraction protocols in samples 
specific to the niche of interest. 

In conclusion, we have shown that DNA extraction has 
an impact on microbial profiling, supporting the concern 
that this step may introduce bias in the portrayal of 
community structure. Although the use of lysozyme and a 
column-based extraction procedure improved the recov- 
ery of certain species over a crude lysis protocol, addi- 
tional steps such as bead beating, or inclusion of an 
enzymatic cocktail as recommended by others (24), may 
be necessary to maximize accuracy. No protocol, however, 
is completely unbiased; therefore, consistency among 
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Fig 3. Effect of two DNA extraction protocols on relative proportions of OTUs and phyla in supragingival plaque. Panel A 
depicts a heatmap with 33 OTUs showing a P value of less than 0.05 when comparing relative abundances in paired plaque 
samples extracted via crude lysis (C) or with the chemical/enzymatic lysis + DNeasy Blood and Tissue kit method (Q). OTUs 
increased in C appear in red, and OTUs increased in Q appear in green. No OTU remained significant after multiple test 
adjustment. Bar graph in panel B shows relative abundance for all phyla detected. Phyla with a statistically significant 
differential representation in either C or Q are marked with an asterisk. 



laboratories studying similar sites is required. Our results 
also raise a cautionary note regarding the possibility 
that some methods introduce high proportions of con- 
taminant sequences, and therefore appropriate controls 
should be part of standard operating procedures in 
every laboratory. 
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